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(-h ! Abstract 
-i— > ■ 

The main purpose of this paper is to estimate the regression function by using 
a recursive nonparametric kernel approach. We derive the asymptotic normality for a 
£N) ■ general class of recursive kernel estimate of the regression function, under strong mixing 

" conditions. Our purpose is to extend the work of Roussas and Tran |17| concerning the 

in 



Devroye- Wagner estimate. 
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><! ■ 1 Introduction 



In this paper we consider nonparametric sequential estimation of a regression functional, 
for dependent observations. Regression function estimation is an important problem in 
data analysis and remains a subject of flight interest, which covers many applied fields 
such as prediction, econometrics, decision theory, classification, communications and control 
systems. The literature on this topic is still growing and some relevant work on this subject 
include the monographs by Prakasa Rao 1 14). Gyorfi et al. [10] and Yoshihara [21], while more 
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recent results are presented in, for example, the books by Gyorfi et al. j X X j and Bosq and 
Blanke [4J. Sequential estimation is achieved with the use of recursive estimators, typically 
kernel ones, and the purpose of this paper is to study a certain class of them. An estimator 
is said to be 'recursive' if its value calculated from the first n observations, say f n , is only 
a function of / n _ x and the n th observation. In this way, the estimator can be updated 
with each new observation added to the database. This recursive property is clearly useful 
in sequential investigations and also for a fairly large sample size, since addition of a new 
observation means the non-recursive estimators must be entirely recomputed. Besides, we 
are required to store extensive data in order to re-calculate them. 

The first kernel recursive regression estimator was introduced by Ahmad and Lin [1] 
taking the form 

t YiK (5 



x-Xi 



i=l v 



which is a recursive version of the Nadaraya- Watson estimate. Also Devroye and Wagner [7] 
propose the recursive estimator of the form 



n , 

V ¥±K x ~ x * 
i=l x 

n / 

V ±K { x ~ Xt 



8=1 

In the literature r^ L (x) and r^ w (x) are respectively the so-called recursive and semi-recursive 
estimators. Various results on the latter estimators were established in an independent 
and identically distributed (i.i.d.) case, by many authors, we cite, among many others, 
Ahmad and Lin [I], Devroye [6], Greblicki and Pawlak [9], Krzyzak [12] and Walk [19]. In 
the dependent case, the majority of works are focused to Devroye- Wagner estimate. In a 
context of strong mixing processes, Roussas [TB] gave the uniform almost sure convergence 
for r® w (x), and Roussas and Tran [T7] showed its asymptotic normality. Under (p- mixing 
conditions, Qin [15J showed the asymptotic normality of r% (x), and Wang and Liang |20| 
studied the almost uniform convergence for truncated versions of r^ w (x) and r^ L (x) in 
the same context. It should be noted that, unlike to the iid case, more results are only 
obtained for r® w (x) in dependent case. In particular no asymptotic normality has so far 
been established for r^ L (x) in this context. Also we remark that, the approach utilized 
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by Roussas and Tran |17| to establishing the asymptotic normality of r® w (x) cannot be 
generalized step by step to r^ L (x). Indeed, the adaptation of their proof to r^ L (x), needs to 
suppose that the sequence - Y^i=\{hi I l h n ) 2d converges to a finite limit, for the study of a few 
covariance terms. The earlier condition is not satisfied by the popular choice h n = cn~^+ i 
for d > 3. Also their proof uses the fact that for all % = 1, . . . , n h n < hi, while the same 
approach applied to r^ L (x), leads to assume that h n > hi, which contradicts the optimal 
choice of h n . 

This paper deals with an extension of the work of Roussas and Tran [T7] to the general family 
of recursive estimators introduced by Amiri [3], whose r^ w (x) and r^ L (x) are special cases. 
The paper is organized as follows. In the next section, we present our main assumptions and 
the result for regression estimation. The proof of the main result is postponed until Section 

El 



2 Sequential regression estimation 
2.1 Notation and assumptions 

Let {{X t , Y t ),t G N} be a sequence of random variables on probability space (Q, J 7 , P), taking 
values in R d x R d ' (d>l,d'>l), and having probability density function f(x,Y) with respect 
to the Lebesgue mesure. Assume that m is a Borelian function on W 1 ' into R such that 
u i — y m 2 (Y t (u)) is P-integrable, and define the regression function as 



S R *> m(y)f (x ,Y){x,y)dy ^ . 

Em(Fo), if f(x) = 



E (m (Fo) |X = x) = ill^g^^ ;= m if f{x) > o 



where / is the probability density function of X . Note that the transformation m is cho- 
sen by the statistician leading to multiple choices of estimation. Typical examples of m 
are identity and polynomial functions to estimate respectively the usual regression and the 
conditional moments. 



Throughout the paper we suppose that /, (p G Cj(6), where Cj(6) denotes the set of 
twice-differentiable functions, with bounded second derivative. This condition is classical 
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in the area of nonparametric estimation and has been used, among others, by Roussas and 
Tran [IT], Bosq and Blanke |3]. 

To estimate the functional r(x), we consider the general family of kernel regression esti- 
mators introduced in Amiri |3j, defined by 

"m(Y t ) fx-Xi 
E —Td^ K 



it \ i=i hf e \ hi 



Eta* 

i=i n i 



x-X„ 



which can be computed recursively by 



r e n {x) 



E hf-^ <pU(x) + (± hi {1 - £) ) m(Y n )Ki (x - X n ) 
(e hf^ fUW + (g hf 1 ^ Ki {x - X n ) 



where 

J 



1 V miXi) K ( X ~ X * \ ft( T ) ■= 1 V J_ K ( X ~ X * \ 



i=l i=l 



and ICf (•) := — K ( t- ) • Our class of estimates includes the popular kernel recursive 

hf E hf 1 "" v l ' 



3=1 



estimators r£ L (x) and r!^ w (x), corresponding to the cases £ = and £ = 1, respectively. 

At this point, we make some assumptions and give the main theorem. Throughout this 
paper the kernel K is assumed to satisfy the following conditions. 

Assumption HI. 

(i) K : M. d h-> R is bounded, symmetric and positive function such that L d K{t)dt = 1; 

(ii) lim \\x\\ d K(x) = 0; 

(iii) f Rd \viVj\K(v)dv < oo, i, j = 1, . . . , d. 



4 



Assume the sequence h n satisfies the following conditions. 

Assumption H2. 

(i) h n |0,n/^+ 2 ^oo; 

(ii) For all r e] - oo, d + 2], B n>r := \ ( j£ J ->• & > as n oo; 

i=i ^ ' 

(iii) For each sequence of integers u n and t>„ such that u n ~ u n , then h Un ~ /i, 

Assumption H3. 

(i) The process (X t ) is a-mixing with 



oix{k) <jk p , k > 1,7 > and p > max I 2, 




(ii) For each couple (s,t), s ^ t, the random vector (X s ,X t ) admits a probability den- 
sity function f {Xa ,Xt) sucn that sup || g s>t || 0C) < oo, where g s>t {-,-) := /(x Sl x t )(-, •) - 

|*-*|>i 

/(•)/(•)• 
Assumption H4. 

(i) The function E(m 2 (Y)|X = •) /(•) is both continuous and bounded away from zero at 

x; 

(ii) There exist A > 0, 6 > such that Eexp(A|m(F )H < oo; 

(iii) For each k ^ k', the random vector (X&, Y^, X&, Yk') admits a probability density 

function f(x k ,Y k ,x k ,,Y k ,), such that sup sup J Rd , J Rd , \G kjk > (s,u,t,v)\dudv < oo, 

|fc-fc'|>i ( s ,t)eR 2d 

where 

Gk,k' ("> "> ') ') = f(X k ,Yk,X k ,,Y k >) ("> "> "> ') _ f(X,Y) ('J ') /(X,Y) ") • 
If a n and b n are two real sequences, a n ~ 6 n means that the ratio a n /b n converges 1 
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Assumptions H.l and H.3 are classical in a nonparametric estimation field and they are 
the same as those classically used in the nonrecursive case. The first is satisfied by Gaussian 
and Eipanechnikov kernels, while the latter is checked by linear processes, as soon as / is 
bounded. Note that H.l(i)-(ii) are technical conditions, the first allows the cancellation of the 
first-order term of Taylor development in the computation of the bias term, while the latter 
ensures the existence of the second-order term. Much more should be said about assumption 
H.2. This latter is particular to the recursive problem and is clearly unrestrictive since the 
choice h n = C n rT v ', with C n \, c > 0, and < v < 1 is a typical example of bandwidth 
satisfying H.2. Concerning H.4, the condition H.4(ii) is clearly checked if m is a bounded 
function, and implies that 

E ( max |m(Y<)| p ] = O ((lnn) p/e ) , for all p > l,n > 2. 

Vl<i<n J 

The earlier condition was used by Bosq and Cheze-Payaud [5] to study the mean square 
error of the Nadaraya- Watson estimator. Assumption H.4(iii) was used by Roussas and 
Tran [T7] to study the asymptotic normality of r® w (x). Now, we can give the main result. 

2.2 Main result 

Let us set 

Bn kn Pqi-Q 2 x (dxidxj + 2 dxi dxj ) J Rd v i v j K ( v ) dv - 

The pointwise asymptotic gaussian distribution for our class of nonparametric recursive 
regression estimate is given in Theorem [1] below, and will be proved in Section [3l 

i 

Theorem 1 When the assumptions H.l — H.4, hold, if for all p > 0, (\nn) B h v n — > 0, as 
n — > oo, then 

y/nhi [r l n (x) - r(x) - B n ] 4 M 
for all x such that f(x) > 0, where 

aj( x ) = ^~ 2e) f(x) I K 2 (x)dx and V(x) = E [m 2 (Y )\X = x] - r\x). 



o. 



■(x)V(x) 



as n — )■ oo, 
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One may derive a simpler version of Theorem [T] by using an additional assumption which 
allows the cancellation of the bias term B n . 

Corollary 1 Under the assumptions H.l — H.4 and if nh^ +4: — > as n — y 0, then 
\Jnh d n [r l n {x) - r(x)] -4 M 

for all x such that f(x) > 0. 

Corollary [1] is an extension of the Rousssas- Iran's [T7] result on Devroye- Wagner estimate 
to the general family of recursive estimators r l n [x) for which the Devroye- Wagner estimate is 
especial case. The condition n/t^ +4 — > as n — > 0, implies that (Inn)* h v n — > 0, for all p > 0, 
and it is satisfied by the choice h n = C n n~ u , with C n 1 c > and + 4) < z/ < l/(d + 2). 
Let us mention that H.2(iii) will play a key role in our methodology, in particular when we 
prove the negligibility of some covariance terms for < £ < (d — 2) + /2, but it is not necessary 
if £ > 1/2. Also if £ > 1/2, our results can be established for p > 2. So, we observe that 
the estimators built with 'small' values of £ allow some restrictions on the smooth parameter 
h n and the strong mixing coefficient. However, as shown in Amiri [2J, these estimators are 
preferable than those built with 'large' £ in terms of small variance criterion. 
In practice, the constants of variance appearing in Theorem [1] need to be estimated. To this 
end, one may consider the simple Gaussian kernel and replace f(x) by fn( x )- There are 
many possibilities for constructing a consistent conditional variance estimate, one may use 
the functional kernel regression technique. 

In order to prove Theorem [H let us consider using the following decomposition. 

ri(x) - r(x) = [f£(x) - r(x)] + [r e n {x) - r e n (x)] , 
where r e n (x) = ip e n (x) / f^(x) , (p n { x ) being a truncated version of <p n {x) defined by 

i=i 

with b n , a sequence of real numbers which goes to +oo as n — > oo. Next, we need the following 
preliminary lemmas. 

Lemma 1 When the assumptions H.l and H.2 hold, then for all £ G [0, 1] 



P(x) 



, as n — )■ oo, 
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(a) 



(b) 



h~ 4 [Ef n {x)-f{x)\ 

K* W n {x) - <p{x)] 

where, if h £ C^(b), we set 



Pd(l-£)+2 
Pd(l-t) 



b 2 f (x) as n — > oo; 



b (x) as n — > oo, 



d 2 h 



(x) / ViVjK(v)dv. 



2 ^— ' dxidxj 

l<i,j<d J 

(c) Moreover if H.3 holds, then 

nh n Varf^(x) — >• cr|(x), as n — >■ oo, 

for all x such that f(x) > 0. 

Proof. The results (a) and (c) of Lemma [Dare obtained in Amiri [2], while (b) can be 
established in the same manner as (a) by substituting / by (p. □ 

Lemma 2 When the assumptions H.l — H.4 hold, then for all £ G [0, 1] 

(a) 

nh^Var(p n {x) — > cr 2 (x) \r 2 (x) + V(x)\ , as n — > oo. 

(b) 

nh d n Cov[f n {x),f n (x)] -> o- 2 (x)r(x) as n ->■ oo. 

Proof, 
(a) Let us set 



V;* =J2 EZ *kl Where = Wn.i-EWn.i, with W n>i 



K 



x — Xi 
hi 



m{Yi) lUm^Kbn} 



The variance of <f n (x) can be decomposed in variance and covariance terms as 

1 



Var<^ n (x) 



n 2 h m-t) B 2 



n n 



Vn + Cov (z fc>n , Zw >n ) 
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Concerning the variance term one may write 

v* nhi 



nht (1 2t) B n>d(1 _ 2l) 7 £ h d(i-e)V £ 



- hf M EK> M m 2 (r )l {MYo)|>M 

E 2 ^ (^f ) ^o)l {M y,)|< 6 „}} =: £1 + ^2 + ^3. 



Assumptions H.4(n), (in), the dominated convergence theorem and Bochner's lemma 
imply that 

l_ R2 [y( u ) + r 2^)j _^ + ||^||2 ? &g fc ^ ^ 

On account of the above, assumption H.2(m) and the Toeplitz lemma allow to deduce 
that 



nK ± [hf~ n) / R < ^K 2 (ag*) + r 2 ( M )] /(„)d« 



^a 2 (:r) [\/(x)+r 2 (x)j, 



as n — y oo. Concerning the term _D 2) if &n = (51nn) e with 5 > |, then using the 
assumptions H.2(ii) and H.4(ii), with the help of Markov's inequality, we have 

1 n 

\\K\&{E m HY )P(\rn(Y )\>K)ynhZZ^ 2M \\K\^{E m HYo)P(\ m (Y )\> bn )}i B n ^ 2M 



\D 2 \ < 



= O 



cxp 



(lnn)# B n _ 2M 



< 



hd r2 

"™ n,d(l-f) 



hd R i 



— y 0, as n — y oo. 



Next for the last term D3, from H.2(i)-(iii) and the logarithmic choice of b n , one may 
write 

2 



\D*\ < 



b 2 n nh d n 



n 2 h m-t) B 2 



k=i ^ ^ 



x-Xj 
hi 



O (hnb^) — y 0, as n — y +00. 



Therefore 



V* ~ n/if - 2 %!_ 2 ,)/(:r) + r 2 (x)] / ^ 2 ( M )^, 



as Ti — y 00. 



It follows that 

nhtV* 



n 2 h m ~ e) B 2 

lb tb n - D „ i d(i-£) 



Pd(i-2i)f(x) [V(x)+r 2 (x)} 



A 2 



d(l-t) 



K 2 (u)du, as n — > oo. 



Now, let us show that the covariance term of Vemp e n (x) is negligible. To this end, define 
a sequence c n of real numbers tending to zero as n goes to infinity, and write 



n rt 



E E covfz 

k=1 k^k' k' = l ^ \'=lj>j j=l '=lj>j j=l 



n n,d(l — t) 



< 



cn n— 1 n 

2( E E ^i+p,p+ E E ^-i+p,p 

K i=lp=l i=c n+1 p=l y , , 



E^ 

i = l 



d(l-f) 



where 



.4 



Cov 



K 



x-X, 



i+p 



h 



i+p,p 



i+p 



X — X 

m (Y i+P ) l{\ m (Y l+p )\<b n }, K ( — - — - 



m {Yp) l{| m (y p )|<b„} 



Udt Udl 
IL i+p IL p 



On one hand, the Billingsley inequality (see e.g., Bosq Blanke jl]) implies that 

A i+P>p < Ab 2 n a x (k) \\K\\lhTf p h; M , 
and then, it follows from assumptions H2(ii) and H.4(iv) that 



s»i E E «x{k)h-«lh- M zbiiWKfSt t^h-t l k h- p M 8^7ll^lL — x E W- 

£ < fc = Cn + lP=l < k = c n p=l ^ P 1 \ "n , 



< 



Hence 



iwqE^ 



E 



E /« 



d(l-<) 



/?./ 



n/^ 2 = O (fe 2 ^/^) . 
On the other hand, regarding about L±, one has 



A . 

^■i+p,p 



JiRd Im.d J R d' J R d' ^ (h i+ p) ^ (lip* 



X-A m (") 1 {|m( M )|<t, n }"^(^)l{|m(i.)|<& n }Gi+p,p(^»".*»^) 



dsdtdudv 



< 



bl(h k+p h p ) sup sup / d , / d , |G fcife /(s,«,t,i;)|d«du 

fc-fc'|>i ( s ,t)em 2d 



d(l-^) 
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Then 



< 



2&£ E E h t+k e)h P < ' 1 l) sup sup /»<*' Ld' \G k k ,(s,u,t,v)\dudv 



E^Y 

i=i / 



|fe-fc'l>i( s ,t)eK 2d " 



2blc n Y. h p a{1 sup sup_/ Hd // Hd /|G fc)fc /(s,t«,t,i;)|dwd« 



(2) 



E^" 



d(l-£) 



At this point, we distinguish two cases according to small and large values of I. 
If£<E , then 2d(l-£) < d+2 replies B nj2 d(i-t) -> ^2d(i-e) < oo, as n ->■ oo, 

because of H.2(u). It follows that 



26^c n S ni2(2 (i-£) sup sup / Rd , / Rd , |G fe)fe / (s,u,*,t;)| dudu 

|fc-jfc'|>l(s 



n ^n,d(l-£) 



_2d 

h 9 

I l"n. 



nh d n U = O (b 2 n c n h d n ) ■ 

i _ 
and b n = (5 Inn) 6 with S > 4, then 



y~" j ^ Cov (Zjfe )n , Zfc/ jn ) = O I b 2 n h n " I -> 0, as n ->■ oo 



d(2-p) 



which implies that 
Thus, when c n := 

since p > 2. Now, if d > 3, £ G [0, then the term cannot be studied 

previously, because assumption H.2(ii) is not satisfied since 2d(l — £) > d + 2. In this 

1 2 

case, let us consider relation (T5]) and choose a real number £ such that < £ < — . 

/9 — 1 a 

d + 2 

Let us mention that £ exists only if p > — - — . Thus, we have the relation d(£ + 1) < 
d + 2, which implies that B n ^+\) —> Pd(£+i) < oo, as n — > oo, by vertue of H.2(zz). 

n . . n 

Next, since h n decreases on has ^2 K — hf- ^ follows that 



as 



8=1 



i=l 



. n o n ti 1 nh n n n M£ + i-) c n o n n 1 n^l3 n ^ + i 



J3=l 



< 



< 



°n n.d 



^Uhf^Y n2h l 2M K dB n,d 

because < i < =>- 1 — £ — 2£ > 0, as long as £ < 4. Therefore, from d2J) we have 
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The choices c n : 
the covariance term 



h p 



and b n = (5 Inn) 9 with 8 > -r, imply the negligibility of 



(b) Let us consider the decomposition 



Cov [/£(*), 



where, for all integers s, t 



.i=i 



n n 



^ Ajj + ^ ^ A i:j 

i=l i^j j'=l 



i=l 



:= Fi + F 2 . 



A. t := Cov 



1 „(x-X 8 \ m(Y t ) 



hf 



K 



h, ) ' hf U m( - Yi ^ bn tt 



K 



x-Xt 



Next, we proceed as in the proof of (a) and find 

nh d n F x ->■ cr|(x)r(x), and nh d n F 2 ->■ 0, as n ->■ oo. 



□ 



3 Proof of main result 

Proof. To prove the main result, we show that the asymptotic distribution of the principal 
term — r(x)] is normal, while the residual term [r£(x) — is negligible. First, 

observe that if b n = (Sinn)' 5 with 5 > | , then for all e > 0, we have 



P (\<pi(x) -<fi(x)\ > e/^nX) < P > &n}) < nP (\Y \ > b n ) < Ee^^n 1 ' 



A<5 



OO / x 

So, for all e > 0, ^ P ( — <&n{x)\ > £/ \/ n ^n J < oo, and the Borel-Cantelli lemma 
n=l ^ ' ' 

implies that 

\Jnh^ \r n [x) — r n {x)\ — > a.s, as n — > oo. 

One may prove in the same manner that f^(x) — > f(x) a.s as n — > oo. Next, we need to 
show that 

f3 d{1 _ 2e) \\K\\ 2 2 V(x) 



Jnhi [f e n (x) -B n - r(x)] 4 M 



0, 



12 



as n — > oo. To this end, we use the following representation 

1 K f'(r) 

r n (x) -r(x)- B n 



E/£(s) 
-E#(s) 



0n( X ) ~ E( Pn( X ) 

f*(x) - Ef n {x) 



+ 



Now, applying the Cramer- Wold device and remembering that f^(x) -4 f(x), and Ef^(x) — > 
f(x), as n — > oo, the proof of the Theorem [T] is straightforward from the following claim: 



fi(x) - Efi(x) 



AM 0, a\ 



V(x) + r 2 (x) r(x) 
r(x) 1 



as n — > oo. 



This last convergence is equivalent to 



(3) 



y/nh* {Ax [fi(x) - Efi{x)] + A 2 [#(x) - ECp l n {x)] } 4 A/" [0, £, 2 (x)] , as n -+ oo, 

for each Ai, A2 £ K such that Ai + A 2 7^ 0, where 

£ 2 (x) := aj{x) {A 2 + 2AxA 2 r(x) + A 2 [V(x) + r\x)] } . 

Hence, the main result will be completely proven if ([3]) were established. To this end, let us 
set 

V nj := X^nj + X 2 K 



where \l/ 
with 



raj •- 



, d(2l-l) 



2 h 



( K , - EV nj ) and % 3 , 



d(2f-l) 



2 h 



jf± (W nj ~ EW nj ) 



V nj := K 



Xj ^ and W nj := K ( - Xj ) m (Yj) l{| TO (y i )|<b n }- 



Next, consider the sequences c„, r ra , and r ra defined as 



T n := [to log raj , ? n :-- 



(logn) 



and r r 



~l~ 7"n 



with r , ^0 > 0. 



To establish 02]), we use the classical Doob [8J methodology, which consists of splitting the 
term 

V^K { Ai l&(x) - /(*)] + A 2 [fax) - ^(x)J } 
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into large blocks separated by small blocks denned by 



T nm = E ^nj (large blocks) , T' nm = ]T (small blocks) 

j = km j = lm 



T nv n +i = E *nj (rest of term), 

j=N+l 

where N := r n (r„ + <^), and for m = 1, . . . , r n , k m := (m - 1)(<?„ + r n ) + 1, l m := (m - 
l)(<j n + T n ) + Next, let us define the partial sums 



Sni = T nm , S n 2 = T nm and S n s = T nrn+ . 



m=l m=l 

Thus, we can write 

V^4{^i [ffe) - /(*)] + A 2 [f n (x) - <p(x)] } = S nl + S n2 + S n3 . 

The goal is to prove that, ES^ 2 and ES^ 3 converge to zero, while the asymptotic distribution 
of S n \ is normal. First, observe that 

ES 2 n2 = EVarCCJ + 2 £ Cov(T^.) 

171=1 l<^<7<7"n 

= E E Var^ m + 2E E CovU m ,^ nj ) ( 4 ) 

m=l i=l m m=ll m <i<j<l m +T n —l 
k+Tn-1 h+Tn-1 , _ , 

+ 2 E EE Cov(* ns ,* nt ) := A x + A 2 +A 3 . 
The first term in is decomposed as 

Ai = E E [A?Var* ni + A|Var^ + 2A!A 2 Cov (tf ni , ^)] := A u + A 12 + A 13 . 

m=l i=l m 

1 

Since /i n decreases, the choice of 6 n = (5 Inn) 9 with 5 > j-, and > l/<jo with the help of 
H2(iii) implies that 



An + A I2 = J^- t " t~ hf" [>&•"< (^) + ^VarA' (tg) Y,l 

n,d(l-l) m=l - =lm \ ' J \ • J 



r„r„(l+%)||A-|fcmax(Aj,A|) 
'» n,d(l-<) 



^ nh£B 2 ► 0, ES 71 -> 00, 



14 



and, similarly, we have A 13 < 2XlX ^^ T "W K \\oo o, as n — > oo. In the same manner and by 

ntl n n,cf(l-«) 

also using the Cauchy-Schwartz's inequality, we get 

r n rl (1 + b n f \\K\\ 2 max(A?, A|) 



A 2 < ^ 1100 ^^->0, asn^oo. 



The last term in (jlj) is bounded by Billingsley inequality with the help of assumptions H2(iii) 
and H.3(i), as follows. 

k+T n -l lj+T n -l 

As= 2 £ E E {AfCov(^ s ,* nt )+A|Cov(^ s ,^) 

l<i<j'<r n s=ii i=(j 

+ A^fCov^^^+Cov^,^)]} 

< 2(1+M H^r x(Al ' A2)fe " EE E E (h s h t )- de a x [k(, n + r n )] 

"■'d^-Z) k=l j=l S=lj t=lj 

< 2 7 (l+6„) 2 ||^||^max(Af,A|)r n ^ ^ pkTn 
— nh%B 2 ~ ~ 

n n,d(l — l) k=l 

Therefore, 

as long as b n = (<51nn) ? with 5 > |, and > 1/<j . Now, let us prove that ES 2 3 — > as 
n — )• 0. One has 

n 

E^ 3 = J2 Var ^ + 2 ^ Cov(^ m ,^):=e nl + 6 n2 . (5) 

j=N+l N+l<i<j<n 

The variance term 0„i may be written as 

n 

Oni= ^2 [A?Vax* ni + A^VaxKi + 2AiA 2 Cov(* fti ,*^-)] := A?e n ii+A^e n i 2 +2AiA 2 e n i 3 . 

j=N+l 

The first term of the right hand side of the preview decomposition satisfies the relation 

n N 

nh d n Vavf n (x) ~ ^ Var(^) = ^ Var(* ni ) + 9 nll . 



However, one may write 



N 

' Nh% Var 



Ev-(^)=(|| 
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Since iV ~ n, the condition u n ~ v n implies h Un ~ h Vn , which leads to nh d n ~ Nh 1 ^, which 

TV 

together with Lemma U)[c) imply that ^2 Var( 1 I r n j) — )■ erf(x), as n — )■ oo. It follows that 

i=i 

n 

©nil = because Var(^ n j) — >■ <rf (x), asn-^ oo. Let us mention that if £ > 1/2, then 

the condition m„ ~ v n implies h Un ~ h Vn , is not necessary. Indeed, the variance term nl 
can be written as 



y A,,/i i^) / //, \ , . (,./-'' - A, 

i=7V+l 

Since h n is decreasing and £ > ~, then the Toeplitz lemma, with the help of assumption 
H.2(ii) and the convergence 



h^VaxK [ X " * ) -> /(x) / K 2 (x)dx, as z -> cx) 



imply that 

Cste (n - A>) (1 + 6 n ) 2 

- i 

Because of n—N < ^„+r n , it follows that O n i — > as n — > oo, provided b n = (5 In n) 9 with 5 > 
| , and # > l/^o- Also, in the same manner, and by replacing by we can deduce from 
Lemmata) that Q n n = o(l). Finally, the last term n i3 is bounded similarly to the first 
term by using Lemma Mjo)- Therefore, from n i3 = o(l), it follows that 0„i — > as n — > oo. 
Now, let us study the term B„2 in (131) . This can be decomposed as 

6 n2 = 2 J] [A^Cov (* ni , * fti ) + A^Cov <J + 2A1A2C0V (* ni , *' ni )] . 

N+l<i<j<n 

As in the proof of Lemma |2l one may show that 

£ [A?Cov (tt ni , * ni ) + A^Cov (M/V, ^-)] ^ 0, as ^ 

iV+l<i<j<n 

and 

£ Cov (* Bi , *y < £ Cov (* ni , *y ^ 0, as ^ 0. 

iV+<?.<j<n 1<«<J<™ 
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Hence, 0„2 — > 0, as — > 0. To complete the proof we must show that the asymptotic distribu- 
tion of S n i is normal. To this end let us check the Lindeberg- Feller conditions for S n i. First, 
we consider a sequence of iid random variables Z n \, . . . , Z nrn , having the same distribution 
as T nm . Then, EZ nl = and if $T nm is the characteristic function (f.c.) of T nm , then 
is the f.c. of the random variable Y Z nm . To establish the asymptotic normality of S n \, 



m=l 



it suffices to prove that the variables Y Z nm and Y2 T nm have the same distribution, and 

m=l m=l 

that this latter is Gaussian. By the Volkonskii-Rosanov [18] lemma, one has 



E Y[ e itTnm - IJ Ee 



itT n 



m=l 



m=l 



< 8(r n - l)ot(r„) < por n e plTn -)• 0, as n -> oo. 



It follows that |EFIm=i e itTnm - -» 0, as n -y oo. Then, it suffices to prove that $£? m 
converges to the characteristic function of a Gaussian random variable. To this end, we pro- 
ceed as follows. Set Z' nm := ^ 2a , where s 2 n := X)m=i VarZ nm . One has -)■ Ej(x), as n ->■ 
oo. Indeed, s 2 = VarT nm — )■ as n — > oo, because on one hand we have from 

m=l 

Lemmas [T] and [2J 

VarS nl ~ nh d n {\\\axf n {x) + A^Var^Jx) + 2A1A2C0V [/^(x), <^(x)] } -> S, 2 (a;), asn^oo, 
and one may show, on the other hand, as for A 2 , that Cov(T ni , T n j) — y 0, as n — > 00. 

l<i<j<r n 

Hence, the variables Z' nm are iid, EZ' nl = and Y VarZ^ m = 1. By virtue of the Lindeberg 

m=l 

conditions (c.f. Loeve [13]), we have to show that for all e > 0, 

^ E (Z'nmMlZ^e}) 0, as 72 ^ OO. 



Noting that \T nm \ < 



m=l 
x/nhiB, 



n D n,d(l-l) 

r n 



and applying Markov's inequality, one has 



m=l 



m=l 



Ee(Okj> £ })= Ee(|i 



-{\T nm \>es n } j ^ -ahd^I 



< 



^(i+b n y\\K 



2||fl|2 »•« 



n,d(l-f)°™ m=1 



Y^ P i\T nm \ > £s r 



< 



Cn(l + 6n) . II^ILe' 1 

\/ri/i^ s « B n,d(l-«) 



— >• 0, as n — y oo, 



if b n = (5 Inn) 5 with 5 > f, and > 1/V 



□ 
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